Difficulties in testing for covarion-like properties of sequences under the confounding influence of changing proportions of variable sites.

نویسندگان

  • Nicole Gruenheit
  • Peter J Lockhart
  • Mike Steel
  • William Martin
چکیده

The covarion (COV)-like properties of sequences are poorly described and their impact on phylogenetic analyses poorly understood. We demonstrate using simulations that, under an evolutionary model where the proportion of variable sites changes in nonadjacent lineages, log likelihood values for rates across site (RAS) and COV models become similar, making models difficult to distinguish. Further, although COV and RAS models provide a great improvement in likelihood scores over a homogeneous model with these simulated data, reconstruction accuracy of tree building is low, suggesting caution when it is suspected that proportions of variable sites differ in different evolutionary lineages. We study the performance of a recently developed contingency test that detects the presence of COV-type evolution modified for protein data. We report that if proportions of variable sites (p(var)) change in a lineage-specific manner such that their distributions in different lineages become sufficiently nonoverlapping, then the contingency test can incorrectly suggest a homogeneous model. Also of concern is the possibility of different proportions of variable sites between the groups being studied. In a study of chloroplast proteins, interpretation of the test is found to be susceptible to different partitioning of taxon groups, making the test very subjective in its implementation. Extreme intergroup differences in the extent of divergence and difference in proportions of variable sites could be contributing to this effect.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum-likelihood phylogenetic analysis under a covarion-like model.

Here, a model allowing covarion-like evolution of DNA sequences is introduced. In contrast to standard representation of the distribution of evolutionary rates, this model allows the site-specific rate to vary between lineages. This is achieved by adding as few as two parameters to the widely used among-site rate variation model, namely, (1) the proportion of sites undergoing rate changes and (...

متن کامل

Modeling the covarion hypothesis of nucleotide substitution.

A "covarion" model for nucleotide substitution that allows sites to turn "on" and "off" with time was proposed in 1970 by Fitch and Markowitz. It has been argued recently that evidence supports such models over later, alternative models that postulate a static distribution of rates across sites. However, in contrast with these latter well-studied models, little is known about the analytic prope...

متن کامل

Modelling the covarion hypothesis of nucleotide substitution

A “covarion” model for nucleotide substitution which allows sites to turn “on” and “off” with time was proposed 27 years ago by Fitch and Markowitz. It has been argued recently that evidence supports such models over later, alternative models which postulate a static distribution of rates across sites. However, in contrast to these latter well-studied models, little is known about the analytic ...

متن کامل

A covariotide model explains apparent phylogenetic structure of oxygenic photosynthetic lineages.

The aims of the work were (1) to develop statistical tests to identify whether substitution takes place under a covariotide model in sequences used for phylogenetic inference and (2) to determine the influence of covariotide substitution on phylogenetic trees inferred for photosynthetic and other organisms. (Covariotide and covarion models are ones in which sites that are variable in some parts...

متن کامل

Testing the covarion hypothesis of molecular evolution.

The covarion hypothesis of molecular evolution states that the fixation of mutations may alter the probability that any given position will fix the next change. Tests of this hypothesis using the divergence of real sequences are compromised because models of rate variation among sites (e.g., the gamma version of the one-parameter equation) predict sequence divergence values similar to those for...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Molecular biology and evolution

دوره 25 7  شماره 

صفحات  -

تاریخ انتشار 2008